Journal article

Automatically Determining the Number of Clusters in Unlabeled Data Sets

Liang Wang, Christopher Leckie, Kotagiri Ramamohanarao, James Bezdek

IEEE Transactions on Knowledge and Data Engineering | Institute of Electrical and Electronics Engineers | Published : 2009

Abstract

One of the major problems in cluster analysis is the determination of the number of clusters in unlabeled data, which is a basic input for most clustering algorithms. In this paper, we investigate a new method called Dark Block Extraction (DBE) for automatically estimating the number of clusters in unlabeled data sets, which is based on an existing algorithm for Visual Assessment of Cluster Tendency (VAT) of a data set, using several common image and signal processing techniques. Its basic steps include 1) generating a VAT image of an input dissimilarity matrix, 2) performing image segmentation on the VAT image to obtain a binary image, followed by directional morphological filtering, 3) app..

View full abstract

University of Melbourne Researchers